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Quantum discord was proposed as a measure of the quantumness of correlations. There are 
at least three different discordlike quantities, two of which determine the difference between the 
efficiencies of a Szilard's engine under different sets of restrictions. The three discord measures 
vanish simultaneously. We introduce an easy way to test for zero discord, relate it to the Cerf- 
Adami conditional entropy and show that there is no simple relation between the discord and local 
distinguishability. 
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I. INTRODUCTION 

We learn about the external world from the correla- 
tions between our measuring devices and the systems 
we study. The information flow is studied in (classical) 
information theory through the analysis of correlations 
[l| . Their build-up and propagation are at the core of 
measurement theory 0, E|, while the research of their 
behavior with respect to spacetime locality led to the 
identification of quantum entanglement and was one of 
the precursors of quantum information theory 0, Q • All 
correlations — classical and quantum — are important 
in statistical thermodynamics [aL which in turn influ- 
ences the entanglement theory Q. Initial correlations 
modify the dynamics of open systems 0, Q , and a signif- 
icant part of quantum information processing is devoted 
to preservation and manipulation of useful correlations, 
while trying to mitigate the effects of the unwanted ones. 

A boundary between quantum and classical correla- 
tions is sharp in pure states, which are either simply sep- 
arable product states or entangled. It becomes less clear 
in mixed states, particularly for systems larger than a 
pair of qubits. We investigate this boundary through a 
characteristic of quantum discord [sHIll (more precisely, 
we review and define three similar, but subtly different 
discord measures). 

First , we recall some basic definitions [l| and set the 
notation. The (Shannon) entropy of a classical discrete 
probability distribution p(a) = p a over a random variable 
A is defined by 



where the base of the logarithm is determined by conven- 
tions. Entropy of the joint probability distribution p(a, b) 
over AB, H(AB) is defined analogously. The Bayes the- 
orem relates it to the conditional probabilities, 



p(a,b) =p(a\b)p(b) =p(b\a)p{a), 



(2) 
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where p(a\b) is a conditional probability of A = a given 
that B = b. The conditional entropy of A 

H(A\B) = Y,p b H(A\b) = - 5^p(o, b) \ogp{a\b) (3) 



is a weighted average of the entropies of A given a par- 
ticular outcome of B. 

Correlations between two probability distributions are 
measured by the symmetric mutual information. It has 
two equivalent expressions, 



I(A : B) = H(A) + H(B) - H(A, B), 



(4) 



and 



J(A : B) = H(A) - H(A\B) = H{B) - H(B\A). (5) 

Quantum-mechanical (von Neumann) entropy [3J is de- 
fined as 



S(p) = -trplogp. 



(6) 



It minimizes the Shannon entropy of probability distri- 
butions that result from rank-1 positive operator- valued 
measures (POVMs) that are applied to the state p on the 
Hilbert space 'Ha- The minimum is actually reached on 
a probability distribution A that results from a projec- 
tive measurement II = {II a ,a = l,...d}, II a = 1, 
Ilallf, = <5 a fcII Q , which is constructed from the eigenpro- 
jectors of p, 



S(p) =min H{Af), 



(7) 



(1) that is, 



S(p)=H(Af), 



P = ^2Patt*a, p a >0,^2p a = l. 

a a 

(8) 

The expression A 1 ^ stands for a classical probability dis- 
tribution (of a measured parameter A) that is obtained 
from the state p under the POVM A. 

Quantum channels are abstracted as maps (typically 
completely positive ones) from the initial states pa to 
the final states px, where the space Wx may be either 
the same space Ha or different one [3] . Any orthonormal 
basis {\a}} oH-La defines a dephasing channel P, 

p^p' = p(p) = ^n aP n a , n a :=|a)(a|. (9) 
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Hence, taking the weighted average over the outcomes of 
a measurement II = {H a }a=i,...d A i s equivalent to send- 
ing the initial state through a dephasing channel with a 
superoperator P = Pn- 

The information function that expresses knowledge @ 
about a system of dimension d in the state p is a variety 
of the negentropy, 

K(p) = logd-S(p), (10) 

It has two related operational interpretations. A 
Maxwell's demon can draw K(p) work from a single heat 
bath using a Szilard engine [13| by using pure states. We 
will discuss the demons in Sec. III. 

On the other hand, K(p) determines a conversion rate 
between pure and mixed states. By allowing arbitrary 
unitary operations, by adding of maximally mixed ancil- 
las and by taking partial traces (and thinking in terms 
of qubits, log 2 = 1), it is possible to perform two tasks 
with asymptotically perfect fidelity. First, given n copies 
of the state p one can obtain nK(p) qubits in a pre- 
determined pure state. This is done by performing the 
usual quantum data compression, but instead of discard- 
ing the then-redundant pure qubits, it is the signal that is 
discarded [6\. Second, by taking nK(p) pure qubits and 
by diluting them with ancillas in the maximally mixed 
state, one produces n copies of p (TJ. 

Quantum discord stems from the fact that the clas- 
sical mutual information can be extended to quantum 
states in two inequivalent ways, following either Eq. Q 
or Eq. (O. In Sec. II, we introduce the three discord mea- 
sures and discuss some of their properties. Their role in 
the efficiency of different Szilard's engines is explored in 
Sec. Ill, and, in Sec. IV, we discuss their relationship to 
the local distinguishability of orthogonal states. 



II. QUANTUM DISCORD 

The first expression for mutual information has an ob- 
vious quantum generalization, 

I(pab) ■= S(pa) + S(pb) - S(pab), (11) 

and represents the total amount of quantum and classical 
correlations frl [iH Il6| . 

A. Conditional "state" definition 

To obtain a quantum version of J (A : B), it is neces- 
sary to determine a conditional state of the subsystem 
B. If the objective is to preserve the equivalence of two 
definitions in the quantum domain, I(pab) = J(pab), 
then the conditional entropy can be introduced |17| as 

S(pb\a) = -trpAslog/OsiA, (12) 



The positive operator pb\a is defined through 
PB \A := hm (p^iPA ® l B )- 1/n ) 

n— too 

= exp(-logp A ® Is + log pab), (13) 

where the inverse of pa is defined on its support. It 
does not usually have a unit trace, and when pa C3> Is 
commutes with pab it reduces to 

Pb\a = Pab{pa ® Is) -1 - (14) 
We will return to this quantity at the end of this section. 

B. Three versions of the discord 

Given a complete projective measurement II on A, a 
quantum definition of J follows its interpretation as the 
information gained about the system B from the mea- 
surement on A 

J nA (pab) := S( PB ) - S( PB \n A ), (15) 
where the conditional entropy is now given by 

S( PB \U A ) := 5>„S(p B |nJ. (16) 

a 

The post-measurement state of B that corresponds to the 
outcome A = a is 

p B \U a = (II a <g> iBpAB^a ® 1b)/Po> Pa = tl p A U a . 

(17) 

The state of B remains unchanged, 

PB = tTApAB = y^PapB\Il a - (18) 
a 

Unlike their classical counterparts, the quantum ex- 
pressions are generally inequivalent and I (pab) > 
J n (pab) The quantum discord as defined in 

[9( is the difference between these two quantities, 

Df(pAs) := S( PA ) + S( PB \Il A ) - S(pab)- (19) 

Its dependence on the measurement procedure is removed 
by minimizing the result over all possible sets of II, 

D A (p AB ) := min Df A (pab)- (20) 

Similarly, 

Ji(Pab) ■= max J nA (p AB ). (21) 

This definition of the discord has its origins in the stud- 
ies of the measurement procedure and pointer bases, thus 
projective measurements are natural in this context. It is 
possible to define the discord when the difference is min- 
imized over all possible POVM A [lfj. However, unless 
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stated otherwise we restrict ourselves to the projective 
measurements of rank 1. 

An explicit form of a post-measurement state will be 
useful in the following text. We denote this state as 
p' x = p x , where the subscript X stands for A, B, or 
AB, and use the former expression if it does not lead to 
confusion. After a projective measurement Il A , the state 
of the system becomes 

p'AB=Y.P- n -®P a B' ( 22 ) 
a 

where p a and p% = p B \n a are given by Eq. (fTT|) . and the 
states of the subsystems are 

PA = ^PaUa, PB= p' B = ^PaP a B , (23) 

a a 

respectively. 

The discord of the state pab is zero if and only if it 
is a mixture of products of arbitrary states of B and 
projectors on A [|, 

PAB^^Pa^a® P B ^ Pa > 0, J^p = 1. (24) 

a a 

By using this decomposition a nd p roperties of the en- 
tropy of block-diagonal matrices [12| we can identify 

J uA (p AB )=I(p' AB ), (25) 

because S{p' A ) = H(A^) and 

S(p' AB ) = H(A u p )+S(p B \Ti A ). (26) 

The discord is not a symmetric quantity: It is possible 
to have states with D A (p AB ) 7^ D\(pab)- A subclass of 
separable states that satisfy D A = D± = is of the form 

P C AB = Y, W " bU " ® P b' ( 27 ) 
ah 

where P B is a set of projectors on H B , and consists of 
classically correlated states in the sense of [Ts( | . 
Another possibility is to set 

jf := S( PA ) + S( PB ) ~ [H(A") + S( PB \U A )} 

= S(p A ) + S(p B )-S(p' AB ), (28) 

arriving at the quantum discord as defined in [l9| 

D A (pab) := mm[H(A^) + S(p B \U A )] - S(p AB ), (29) 

where the quantity to be optimized is a sum of post- 
measurement entropies of A and B. By using Eq. (J7J, 
we see that Di < D 2 - It is also easy to see that D% = 
<=> D 2 = 0. By using Eqs. d22J) and ([Ml), we obtain a 
different expression for D 2 : 



Since the definition of the discord(s) involves optimiza- 
tion, the analytic exp ressions are known only in some 
particular case [H, [HlHH. Moreover, typically it is im- 
portant to know whether the discord is zero or not, while 
the numerical value itself is less significant. 

It follows from Eq. (|24p that if the spectrum of a re- 
duced state p A — ^2 a p a n a is non-degenerate, then its 
eigenbasis gives a unique family of projectors II that re- 
sults in the zero discord for pab- Hence, a recipe for 
testing states for zero discord and for finding the opti- 
mal basis is to trace out a subsystem that is left alone 
(B), to diagonalize pa and to calculate the discord in the 
resulting eigenbasis. 

If the state pa is degenerate, a full diagonalization 
should be used. For the state with the form of Eq. (|24l) . 
each of the reduced states p% can be diagonalized as 

/>B=E r ? P a> Pa P a = Sbb ' P a- ( 31 ) 

b 

The eigendecomposition of the state pab then easily fol- 
lows. Writing it as 

Pab =^v?n a ®? Q 6 , (32) 

a . b 

it is immediate to see that its eigenprojectors are given 
by H a <S>Pa- Hence, if ps has a degenerate spectrum, but 
Pab has not, the structure of its eigenvectors reveals if it 
is of a zero or nonzero discord. Hence we established 

Property 1. The eigenvectors of a zero discord state 
D a {pab) = satisfy 

PA B \ab) =r ab \ab) 7 => \ab) (ab\ = Il a ® P b a . (33) 

□ 

This consideration leads to the simplest necessary con- 
dition for zero discord (first noticed in [2lj): 
Property 2. If D A (p AB ) = 0, then ' 

[pa®1b:Pab] =0. (34) 

Hence a non-zero commutator implies D a (pab) > 0. □ 
Naturally, if the state has zero discord, and the eigen- 
basis is only partially degenerated, we can use it to reduce 
the optimization space. On the other hand, the diago- 
nalizing basis is not necessarily the optimal basis H 
or n that enters the definition of D\ or D 2 , respectively. 
Consider, for example, a two-qubit state 

Pab = 1(1 ab + bcr z A ® 1 B + ca A ® (t b ), (35) 

where a x are Pauli matrices on the relevant spaces, X — 
A, B, and the constants b and c are restricted only by 
the requirements that pab is a valid density matrix. For 
this state p B = 1/2 and pa — diag(l + 6, 1 — b)/2. After 
the measurement in the diagonalizing basis Ii z = ((1 + 
<7 z )/2, (I — <t z )/2) the conditional state of B becomes 



Df(pAB)^S(pf B )-S(pAB). 



(30) 



PB\n* ± = 1/2, 



(36) 



4 



and the conditional entropy is maximal, S(p B \H z ) = 
log2. 

On the other hand, in the basis IP = ((1 + <r x )/2, (1 - 
a x )/2) the probabilities of the outcomes are equal, p + = 
P- = 1/2, but the post-measurement states of B are 
different from the maximally mixed one, 

p m% = §(l±cO, (37) 

so the entropy S{p B \W) > S(p B \U x ). 

This discrepancy motivates us to define a new version 
of the discord, which is useful if the eigenvectors of pA 
are not degenerate, 

D A (p AB ) := S(p A ) - S(pab) + S(p B \n A ), (38) 

where Il A is the set of eigenprojectors of pa- Otherwise 
it can be introduced using the continuity of entropy in 
finite-dimensional systems [12J. By applying Eq. ((7} to 
the subsystem A we find that D 3 simultaneously holds 
the analog of Eq. (j2"5j) , 

Mpab) = I(Pab), (39) 

and of Eq. §U\), 

D 3 (p AB ) = S(p%) - S(pab). (40) 

We also arrive to the following ordering of the discord 
measures: 

D A < d a < D A . (41) 

There are several important cases when the measures of 
discord coincide. Most importantly, they vanish simulta- 
neously: 

Property 3. D x = & D 2 = ^ D 3 = 0. 
The proof follows from Eas.(|23| and ([ST]). □. 

On pure states the discord is equal to the degree of 
entanglement, 

D A {4> AB ) = S{4> a ) =E{^ab), i = l,2,3. (42) 

Discord is also independent of the basis of measurement 
if the state is invariant under local rotations @ . Finally, 
if A is in a maximally mixed state, then D A — D 2 ■ 

These coincidences make it interesting to check when 
the discords D\ and D 2 are different. By returning to 
the measurement-dependent versions of the discords, we 
see that 

Df(pAB) = Df(pAB) - [H(A?) S(p A )}. (43) 

Assume that (pab) reaches the minimum on the set 
of projectors II, which are not the eigenprojectors of pa- 
In this case H(A^) — S(pa) > 0, so we can conclude that 
the strict inequality D A < D 2 holds, because 

D?0>ab) < Df(pAB) (44) 

=D 2 a { Pa b) - [H(A$) - S( PA )} < D£( PA b). 

For example, the state of Eq. (|3"5j) . with b = c = 4, 
satisfies Df « 0.05, w 0.20 and Df w 0.21. 



C. Relations with other quantities 

Quantum discord (pa h) is a concave function over 
the set of all POVMs A A [llj and the minimum is reached 
on a rank-1 POVM that consists of linearly independent 
operators [22| . The easiest way to obtain this result is to 
note that the set of all POVMs is convex, and the mini- 
mum of a concave function over a convex set is obtained 
on its boundary. 

The states of zero discord are nowhere dense in the set 
of all states [2l|. Nevertheless, it is obvious that even 
double-zero discord states of Eq. (|3"2"f convexly span the 
set of all states. 

The operator Pb\a has a closed form on the states of 
zero discord. Property 1 allows to write it as 



Pb\a = 




= Y J ^a®p a B , (45) 

a 

which indeed results in the conditional entropy 

S(pb\a) = £ w a S(p%) = S( PB \U A ). (46) 

a 

In general it follows from the definitions that 

S(p BlA ) = S(p B \U A )-Df A (pAB), (47) 

in any basis, not only in the optimal one. 

In the paradigm of closed local operations Q , Alice and 
Bob are allowed to perform arbitrarily local unitaties and 
projective measurements, and Alice can send her system 
to Bob via a dephasing channel. In the one-way ver- 
sion only a single use of the channel is allowed. Since at 
the end of the operation both system are accessible to 
Bob, the discord D 2 was identified with one-way quan- 
tum deficit @, 

A^(pab) = rmnSipfs) - S( P ab) = D A (p AB ). (48) 

Operationally, it expresses the fact that a naive one- 
way purification strategy consists of Alice performing the 
measurement II and announcing her results to Bob (or, 
equivalently, sending her part of the state individually 
through the channel Pfj). In this case the purification 
rate is given by K 2 (pab) = \og d A d B - S{p^ B ). 

However, it is the discord D A that gives the optimal 
efficiency of a purification in this context. If Alice is 
allowed to borrow pure states (that are returned at the 
completion of the protocol) and use block encoding prior 
to (individually) sending her particles and Bob, then [24| 
the optimal rate is 

K^(pab) = \ogd A d B + Hp^b) - S(pa) - S(p B ), (49) 



5 



so using Eq. (f25j) we see that K^(pab) 
J\(pab) ~ S(p A ) - S(p B ), or 



log dj^ds 



>A ) 
K(p AB ) 



K^(p AB ) 



AB) 



(50) 



Since the additivity of Jj 4 was shown to be equivalent 
to several other additivity problems [Hj], including the 
Holevo capacity of quantum channels, and the latter was 
finally disproved [251 ] . block processing will improve the 
efficiency for entangled pab- 

A symmetrized version of J3 involves (projective) mea- 
surements on both sides [23J and is given by J^ yu1 (pab) — 

/(p^® n * ). The symmetrized discord 
I (Pab) 



n s y m •- 



4 ym (p 



AB 



(51) 



is called the measurement induced disturbance and serves 
as another upper bound on the discord. 

III. MAXWELL'S GOBLINS 

Maxwell's demon [2(| is a "being whose facilities are so 
sharpened" as to enable him to challenge the 2 nd law of 
thermodynamics. Modern exorcism mostly focuses on his 
information-processing ability, with information erasure 
cost balancing the books and keeping the 2 nd law intact. 
Quantum logic and quantum correlations introduce new 
subtleties into this discussion (l3| . 

A typical setting is provided by a (quantum) Szilard's 
model [27j] , in whose original form the demon operates a 
heat engine with one-particle working fluid. For our pur- 
poses it is enough to consider only the work-extracting 
phase of the cycle and ignore the the resetting of the 
demon's memory. The optimal work extracted from a 
system of a dimension d in a known state p at the tem- 
perature T is on average 



W+ = kT(\ogd- S(p)) = kTK(p), 



(52) 



where k is Boltzman's constant adjusted to the base of 
the logarithm. 

For a bipartite state pab the benchmark performance 
W + (pab) is achieved by a fully quantum (non-local) de- 
mon Charlie that can perform arbitrary quantum opera- 
tions on the system. We compare his performance with 
actions of two local goblins of lesser power. Alice and 
Bob are local goblins that can perform only local oper- 
ations on their subsystems. They may have only partial 
information about the state pab and may not be able to 
communicate freely. 

The work that is extracted by Alice and Bob that are 
aware of their respective states pa,b but are not allowed 
to communicated is 



W L = kT{\ogd A d B - S(p A ) - S( PB )), 



(53) 



so the difference of the extracted work by a global demon 
and local non-communicating goblins is given by the mu- 
tual information, 



A much more interesting scenario that actually moti- 
vated the introduction of D 2 was proposed in In 
this setting both Alice and Bob know the state pab and 
Alice can communicate to Bob the results of her mea- 
surement. She chooses her measurement n in such a way 
as to maximize the extracted work 

W 2 = [\ogd A - H(A*)} + [\ogd B - S( PB \U A ) 

= logd A d B -S(p T t B ), (55) 

through steering of Bob's state to PB\u a which on average 

make up for a higher entropy of p\ — J2a^aPA^a- 
Hence the minimal difference between the work extracted 
by the goblins and the work extracted by the demon is 
given by 



A 2 W 



kTD A (p 



AB 



(56) 



In this setting Alice and Bob are essentially performing 
the purification protocol from Sec IIC. 

An operational meaning of D3 is clarified in the setting 
where Alice is still able to report her results to Bob, but 
has less knowledge than in the original example. Namely, 
Alice knows only pa, while Bob is aware of the entire 
state pab- In this case the best Alice can do is to perform 
the measurement in the eigenbasis of p, and tell her result 
to Bob. Then on average the gain is 

W 3 = kT(logd A -S(p A ))+kT(logd B -S(p B \IL A )), (57) 

so the difference in the extracted work is now 



A 3 W = kTD A 



(P 



AB) 



(58) 



IV. 



LOCAL DISTINGUISHABILITY AND 
DISCORD 



Since zero discord is thought to represent the absence 
of classical correlations, it is interesting to investigate 
the following question. Consider a set of pure orthogonal 
bipartite states, each of which may have a different prior 
probability, with the ensemble density matrix pab ■ Does 
the value of D(pab) tell us something about the ability 
to perfectly distinguish these states by local operations 
and classical communication (LOCC)? 

As exhibited Table I , there is no relation between 
D(pab) and local distinguishability. First, it was shown 
in |28[ that a certain set of nine 3x3 product orthogonal 



states cannot be perfectly distinguished by LOCC. These 
"sausage" (or "four and a half tatami teahouse") states 
■ • ■ IV'g) are 



A L W := W + — Wl — kTI(p AB ) 



(54) 



|1)<8|1>, |0)(g)|0±l)/v / 2, |2)®|1±2)/V2, 

|1±2)®|0)/V5, |0±1)|2)/V2, (59) 

Their equal mixture is the maximally mixed state pab = 
1/9 that obviously has a zero discord, D a (pab) = 
D B ( PAB )=0. 
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Second, it was shown [29| that any two orthogonal (en- 
tangled or not) states can be perfectly distinguished by 
LOCC. Consider a mixture of the Bell states |^ ,± ), 

Pab = a|* + )(* + | + (1 - o)|*-)<*-|, < a < 1. 

(60) 

The discord of such a state can be calculated analytically. 
It equals to 

D if( a ) =a\og 2 a- (1 - o)log 2 a + l, (61) 

which vanishes only for the equal mixture a = i. 

Two other cases are easy. A pair of bi-orthogonal prod- 
uct states, such as |0)|0) and |1)|1) results in a zero dis- 
cord. Mixing nine teahouse states with different weights 
may result in a non-zero discord. For example, giving 
\ipg) and ^7) weights which are twice as high as the 
rest of the states results in a mixture pab for which 
\fiA <S> Is , Pab] 7^ 0, thus implying a non-zero discord. 



TABLE I: Local measurability vs. discord 



States 


Discord 




Locally 










measurable 


9 teahouse states, equal weights 


D A 


= D B 


= 


no 


2 product bi-orthogonal states 


D A 


= D B 


= 


yes 


2 entangled orthogonal states 


D A 


> 




yes 


9 teahouse states, unequal weights 


Dt 


> 




no 



vanish simultaneously, and it is easy to check if this is 
the case, using Properties 1 and 2. The discord mea- 
sures D2 and D3 have a natural physical interpretation 
in terms of the work extracted by a pair of Maxwell's 
demons that operate on a bipartite system under differ- 
ent sets of restrictions. On the other hand, depending on 
the imposed restrictions, it is either D\ or D 2 that can 
serve as a measure of quantum deficit in the state. 

Despite its intuitive appeal quantum discord D(pab) 
is not an indicator of wether the ensemble of states pab 
is made up of states which are distinguishable by LOCC. 
Zero quantum discord allows for a completely-positive 
dynamics of an open system even if there are initial corre- 
lations with the environment Q . In a forthcoming paper 
[301 ] it will be shown that it is not enough for practical 
quantum tomography. 

We discussed the discord in the states on finite- 
dimensional spaces. One obvious difficulty in the gen- 
eralization to the continuous variables is in the infinite- 
dimensional optimization that is involved in the defini- 
tion of D\ and D 2 - It is possible that the measure D3 is 
more suitable in the latter case. 
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